On optimal multiple changepoint algorithms for large data

نویسندگان

  • Robert Maidstone
  • Toby Hocking
  • Guillem Rigaill
  • Paul Fearnhead
چکیده

Many common approaches to detecting changepoints, for example based on statistical criteria such as penalised likelihood or minimum description length, can be formulated in terms ofminimising a cost over segmentations. We focus on a class of dynamic programming algorithms that can solve the resulting minimisation problem exactly, and thus find the optimal segmentation under the given statistical criteria. The standard implementation of these dynamic programming methods have a computational cost that scales at least quadratically in the length of the time-series. Recently pruning ideas have been suggested that can speed up the dynamic programming algorithms, whilst still being guaranteed to be optimal, in that they find the true minimum of the cost function. Herewe extend these pruningmethods, and introduce twonewalgorithms for segmenting data: FPOPand SNIP.Empirical results show that FPOP is substantially faster than existing dynamic programming methods, and unlike the existing methods its computational efficiency is robust to the number of changepoints in the data. We evaluate the method Electronic supplementary material The online version of this article (doi:10.1007/s11222-016-9636-3) contains supplementary material, which is available to authorized users. B Robert Maidstone [email protected]; [email protected] 1 STOR-i Centre for Doctoral Training, Lancaster University, Lancaster, UK 2 McGill University and Genome Quebec Innovation Center, Quebec, Canada 3 Institute of Plant Sciences Paris-Saclay, UMR 9213/UMR1403, CNRS, INRA, Université Paris-Sud, Université d’Evry, Université Paris-Diderot, Sorbonne Paris-Cité, Paris, France 4 Department of Mathematics and Statistics, Lancaster University, Lancaster, UK for detecting copy number variations and observe that FPOP has a computational cost that is even competitive with that of binary segmentation, but can give much more accurate segmentations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MINIMIZING HANKEL’S NORM AS DESIGN CRITERION OF MULTIPLE TUNED MASS DAMPERS

Tuned mass damper (TMD) have been studied and installed in structures extensively to protect the structures against lateral loads. Multiple tuned mass dampers (MTMDs) which include a number of TMDs with different parameters have been proposed for improving the performance of single TMDs. When the structural system is considered as multiple degrees of freedom (MDOF) and implemented with MTMDs, t...

متن کامل

Analysis of statistical and standard algorithms for detecting muscle onset with surface electromyography

The timing of muscle activity is a commonly applied analytic method to understand how the nervous system controls movement. This study systematically evaluates six classes of standard and statistical algorithms to determine muscle onset in both experimental surface electromyography (EMG) and simulated EMG with a known onset time. Eighteen participants had EMG collected from the biceps brachii a...

متن کامل

An Optimal Model for Medicine Preparation Using Data Mining

Introduction: Lack of financial resources and liquidity are the main problems of hospitals. Pharmacies are one of the sectors that affect the turnover of hospitals and due to lack of forecast for the use and supply of medicines, at the end of the year, encounter over-inventory, large volumes of expired medicines, and sometimes shortage of medicines. Therefore, medicine prediction using availabl...

متن کامل

An Optimal Model for Medicine Preparation Using Data Mining

Introduction: Lack of financial resources and liquidity are the main problems of hospitals. Pharmacies are one of the sectors that affect the turnover of hospitals and due to lack of forecast for the use and supply of medicines, at the end of the year, encounter over-inventory, large volumes of expired medicines, and sometimes shortage of medicines. Therefore, medicine prediction using availabl...

متن کامل

Single-Setup-Multiple-Deliveries for a Single Supplier-Single Buyer with Single Product and Backorder

  This article investigates integrated production-inventory models with backorder. A single supplier and a single buyer are considered and shortage as backorder is allowed for the buyer. The proposed models determine optimal order quantity, optimal backorder quantity and optimal number of deliveries on the joint total cost for both buyer and supplier. Two cases are discussed: single-setup-singl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics and Computing

دوره 27  شماره 

صفحات  -

تاریخ انتشار 2017